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(54) Incorporating predictive models within interactive business analysis processes 



(57) A Customer Relationship Management (CRM) 
system that incorporates predictive models. The system 
is used by business users who are unfamiliar with the 
art of data mining. The predictive model, which is con- 



structed by a model-building mechanism in a data min- 
ing subsystem, accepts the appropriate input attributes, 
performs calculations against a segment comprised of 
records, and generates an output attribute. 
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Description 

[0001] This invention relates in general to database 
management systems performed by computers, and in 
particular, to the incorporation of predictive models with- 
in a Customer Relationship Management (CRM) system 
that uses a Relational Database Management System 
(RDBMS). 

[0002] In a data warehouse environment, the function 
of interactive business analysis is supported by a variety 
of applications and tools, including RDBMS (Relational 
DataBase Management System) and OLAP (On-Line 
Analytic Processing) tools. Typically, such business 
analysis tools use metadata to represent business con- 
cepts, and to provide a mapping from the business con- 
cepts to data stored in the data warehouse. A business 
analyst can then use familiar business terms to request 
an analytic function, and the tool will convert the busi- 
ness terms to the appropriate Table/Column names, and 
generate and then execute the appropriate SQL to per- 
form the function. Thus, the analyst can request a report 
on "Sales" for "Eastern Region," where "Sales" is a 
"Measure" and "Eastern Region" is a "Segment." In this 
example, Segments and Measures are types of meta- 
data, wherein Measures are values or expressions that 
are useful in reviewing, analyzing or reporting on data 
elements represented by segments. 
[0003] The Measures supported by current analytic 
tools are either simple aggregations (e.g., "Revenues"), 
or more complex derivations based on formulas (e.g., 
"Margin" = "Revenues - Costs," and "Percent Margin" = 
"Margin / Revenues * 1 00"). In each case, the definitions 
for the Measures is provided by a human, e.g., by a busi- 
ness analyst during a set-up process that occurs follow- 
ing installation but prior to execution of the tool. Usually, 
the definitions for the Measures comprise a manual 
metadata definition process. 

[0004] A Measure might be predictive, e.g., rather 
than measure past performance or behavior, it might 
predict future performance or behavior, typically in the 
form of a propensity score. For example, it might predict 
the propensity of a Customer Segment to purchase a 
product or to terminate service. The formula for a pre- 
dictive Measure might be provided by a human, based 
on prior experience or intuition. 
[0005] A more rigorous approach would be to use a 
predictive modeling system, the output of which is typi- 
cally a predictive model which may or may not be in 
some executable form. Typically, in order to use such a 
model as a predictive Measure in a business analysis 
tool, it would be necessary for a human to translate the 
predictive model formula into an appropriate form (e.g., 
as SQL statements) that can be processed by the busi- 
ness analysis tool. 

[0006] Unfortunately, predictive modeling systems 
are technically complex, and require a high level of sta- 
tistical and data mining skills to create successful mod- 
els, including knowledge about the algorithms involved 



and how they operate. They also typically require expert 
knowledge of the data involved in the prediction, and 
programming skills in order to manipulate the data into 
a form that the predictive modeling system requires. 
s [0007] There is a need then to make data mining al- 
gorithms more accessible and more available to busi- 
ness people. There is a strong requirement for market- 
ing campaign planners with modest technical skills (e. 
g., business analysts) to be able to incorporate predic- 
10 tive models directly in support of their business tasks. 
[0008] From a first aspect, the present invention re- 
sides in a method for using predictive models within a 
computer-implemented business analysis environment, 
comprising applying a derived measure against a seg- 
ment, wherein the derived measure comprises a predic- 
tive model that previously-built by a model-building 
mechanism in a data mining system and generating out- 
put for the segment from the predictive model in the form 
of measure values. 

[0009] From a second aspect, the invention resides 
in a computer-implemented system for using predictive 
models within a computer-implemented business anal- 
ysis environment, comprising means for applying a de- 
rived measure against a segment, wherein the derived 
measure comprises a predictive model that previously- 
built by a model-building mechanism in a data mining 
system and means for generating output for the seg- 
ment from the predictive model in the form of measure 
values. 

[0010] The derived measure is preferably invoked 
within an application template, the application template 
comprising a sequence of elements linked together in a 
workflow, and the elements are selected from a group 
comprising a segment, a filter, a measure and a function. 
The application template is constructed in a visual pro- 
gramming environment and preferably can be reused 
and/or modified by users. 

[0011] A segment is preferably defined as a grouping 
of data elements from a database organized about one 
or more attributes, a filter defines one or more attribute 
constraints applied to a segment, a profile is a labeled 
collection of attributes of a segment and a measure is 
an expression applied to a segment. 
[001 2] The computer-implemented business analysis 
environment preferably includes an object model, and 
the segments, attributes, filters, and measures com- 
prise objects, operations upon the objects preferably be- 
ing translated into SQL statements that access corre- 
sponding tables and columns in a relational database. 
The predictive model may comprise one or more SQL 
statements that access tables and columns in a relation- 
al database or may comprise one or more statements 
executed by a database management system. These 
statements preferably access data stored in the data- 
base management system. 

[0013] The model-building mechanism preferably 
comprises an analytic algorithm for rule induction per- 
formed against data stored in a database management 
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system to create the predictive modeL 
[001 4] The invention may also be expressed as an ar- 
ticle including one or more machine-readable storage 
media containing instructions for using predictive mod- 
els within a computer-implemented business analysis s 
environment, as a data structure stored in memory of a 
computer comprising instructions for using predictive 
models within a computer-implemented business anal- 
ysis environment and as a computer program compris- 
ing program instructions for using predictive models 10 
within a computer-implemented business analysis envi- 
ronment, the instructions when executed by a computer 
system causing the system to perform the method de- 
scribed above. 

[0015] The present invention provides a mechanism 15 
for incorporating predictive models in an interactive 
business analysis environment, where an analyst uses 
business terms to accomplish his or her tasks, and 
where the system translates the functions requested by 
the analyst into SQL statements and then executes 20 
these SQL statements against a relational database. A 
previously-generated predictive model is accessed by 
the user in the form of a Measure, and is applied against 
a target Segment, thereby achieving the effect of dy- 
namically characterizing the Segment based on the set 25 
of attributes presented with it. This can be used to dy- 
namically score customers based on current informa- 
tion. The present invention also exploits Application 
Templates to reuse expert knowledge gained from prior 
model-building experiences, making such knowledge 30 
available to lower-skilled business analysts such that 
they can successfully build predictive models. The 
present invention also exploits visual programming 
techniques for constructing and representing the steps 
in the Application Templates. 35 
[001 6] An embodiment of the invention will now be de- 
scribed with reference to the accompanying drawings in 
which like reference numbers represent corresponding 
parts throughout: 

40 

FIG. 1 illustrates an exemplary hardware and soft- 
ware environment that could be used with the 
present invention; 

FIG. 2 illustrates an exemplary architecture for the 
Customer Relationship Management (CRM) Server 
according to the preferred embodiment of the 
present invention; 

FIG. 3 illustrates a graphical user interface (GUI) of 
the Customer Relationship Management (CRM) 
Client according to the preferred embodiment of the so 
present invention; 

FIG. 4 illustrates an Application Template used to 
incorporate a predictive model according to the pre- 
ferred embodiment of the present invention; and 
FIG. 5 is a flowchart that illustrates the logic used 55 
to incorporate a predictive model according to the 
preferred embodiment of the present invention. 



DEFINITIONS 

[0017] In the context of this application, the following 
terminology is used: 

Segment: A Segment is a grouping of data ele- 
ments organized about one or more attributes. Seg- 
ments may be subdivided into Sub-Segments 
based on Attributes or Filters, which may be cate- 
gorical (such as "Type of Residence," "Marital Sta- 
tus," or "Brand"), numeric (such as "Age > 65," or 
"Price >$25"), etc. Sub-Segments themselves can 
be further subdivided into Sub-Segments. 
Filter: A Filter defines one or more attribute con- 
straints applied to a Segment or Sub-Segment, usu- 
ally to create a Sub-Segment. For example, a Seg- 
ment "California Customers" may be constrained to 
"Female California Customers" by applying a Filter 
for gender. 

Profile: A Profile is a collection of attributes relating 
to a Segment. For example, a Demographic Profile 
might include those attributes of a Customer Seg- 
ment that contain demographic information, such 
as gender, zip code, marital status, household 
group, home ownership, vehicle ownership, etc. 
Pre-defined Profiles may be available, as is the abil- 
ity to create ad hoc profiles from the available set 
of attributes for a Segment. 
Measure: A Measure is a formula applied against a 
Segment or Sub-Segment. A Measure may involve 
simply aggregating values retrieved from a data- 
base, computing a formula, or executing a previous- 
ly-built predictive model. 

Function: A Function is a control for splitting, merg- 
ing, and branching. 

Application Template: An Application Template is a 
sequence of Segments, Filters, Measures and 
Functions linked together in a workflow, wherein ar- 
rows indicate the flow of data. 

ENVIRONMENT 

[0018] FIG. 1 illustrates an exemplary environment 
that could be used with the preferred embodiment of the 
present invention. In the exemplary environment, a 
computer-implemented Customer Relationship Man- 
agement (CRM) system 100 comprises a three-tier cli- 
ent-server architecture, wherein the first tier comprises 
a CRM Client 102 that provides a graphical user inter- 
face (GUI) or other application, the second tier compris- 
es a CAM Server 1 04 that provides a framework for ex- 
ecuting CRM applications, and the third tier comprises 
a Relational DataBase Management System (RDBMS) 
server 106 that manages a relational database 108 
(which includes both data and metadata). 
[0019] In the preferred embodiment, the CRM Client 
102, CRM Server 104, RDBMS 106, and relational da- 
tabase 108 each comprise logic and/or data tangibly 
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embodied in and/or accessible from a device, media, 
carrier, or signal, such as RAM, ROM, one or more of 
the data storage devices, and/or one or more remote 
systems or devices communicating with the CRM sys- 
tem 1 00 via one or more data communications devices. 
[0020] However, those skilled in the art will recognize 
that the exemplary environment illustrated in FIG. 1 is 
not intended to limit the present invention. Indeed, those 
skilled in the art will recognize that other alternative en- 
vironments may be used without departing from the 
scope of the present invention. In addition, it should be 
understood that the present invention may also apply to 
components other than those disclosed herein. 

CUSTOMER RELATIONSHIP MANAGEMENT 
SERVER 

[0021] FIG. 2 illustrates an exemplary architecture for 
the CRM Server 1 04 according to the preferred embod- 
iment of the present invention. The CRM Server 104 
provides a common, adaptable, and extensible platform 
for the development of CRM applications. It uses an ob- 
ject modeling concept to transform data stored in the 
relational database 1 08 into an object model to be used 
within the CRM Server 104 and CRM Client 102. This 
allows the CRM Server 1 04 and CRM Client 1 02 to focus 
on the object model and not be concerned with the SQL 
statements required to access the relational database 
108. 

[0022] In the exemplary architecture, the CRM Server 
104 provides a framework for executing CRM applica- 
tions, and includes a Segment Manager 200, Measure 
Manager 202, Functions 204, and Application Tem- 
plates 206, as well as other components. The CRM 
Server 1 04 also includes an Object Manager 208 that 
interfaces to the RDBMS 1 06, wherein the Object Man- 
ager 208 includes functions related to SQL Generation 
210, SQL Execution 212, and an application program- 
ming interface (API) 21 4 to the TeraMiner™ product pro- 
vided by NCR Corporation, the assignee of the present 
invention. 

Metadata 

[0023] The functions performed by the CRM Server 
104 and its components are metadata-driven, wherein 
the metadata describes one or more "Business Models" 
and one or more "Business Rules." Metadata defines 
the mapping between objects instantiated in the CRM 
Server 104 and the data stored in the relational data- 
base 108. In this regard, the metadata supports the 
mapping of Segments, Attributes, Filters, and Measures 
to the relational database 108. 

Segment Manager 

[0024] The Segment Manager 200 provides a com- 
mon segmentation engine for use by the other elements 



of the CRM Server 104. The Segment Manager 200 
supports user requests, such as defining Segments, ap- 
plying Measures and Filters to Segments, profiling Seg- 
ments, saving Segments, and displaying Segments, as 
5 well as merging Segments, removing duplicate entries 
from Segments, deleting Segments, etc. Users can 
choose from a set of pre-defined Segments or create ad 
hoc Segments. 

10 Measure Manager 

[0025] The Measure Manager 202 provides a com- 
mon measurement engine for use by the other elements 
of the CRM Server 1 04. A Measure is a value or expres- 
sion applied to a Segment. A Measure may be a simple 
base measurement (e.g., mapped directly to a field of a 
table in the relational database 1 08 or calculated on one 
or more such fields), or a compound derivative meas- 
urement (a calculation involving one or more base 
20 measurements). Users can choose from a set of pre- 
defined Measures or create ad hoc Measures. 

Functions 

25 [0026] A Function 204 is a module that provides a 
control service in an Application Template 206. A Func- 
tion 204 may be a standardized programming element 
that allows the user to control the sequence of steps, or 
a customized programming element. 

30 

Application Templates 

[0027] An Application Template 206 is a sequence of 
Segments, Filters, Measures and Functions linked to- 

35 gether in a workflow. The sequence that comprise a 
workflow in the Application Templates 206 are repre- 
sented as icons that are linked together, wherein the di- 
rection of the connecting arrows determine the se- 
quence of execution and the flow of data. 

40 [0026] Users can choose from a set of pre-defined Ap- 
plication Templates 206 or create ad hoc Application 
Templates 206, as desired. In either case, the user can 
create and manipulate the Application Template 206. 
Further, users can add, modify and delete the steps 

4 $ within the Application Template 206. 

Object Manager 

[0029] The Object Manager 208 instantiates data 
so from the relational database 108 via the RDBMS 106, 
and wraps that data in objects for use by the Segment 
Manager 200, Measure Manager 202, Functions 204, 
Application Templates 206, and other elements of the 
CRM Server 104. The Object Manager 208 interprets 
55 the metadata, generates the necessary SQL statements 
in module 210, and then executes the generated SQL 
statements in module 212. Object classes that repre- 
sent business models can be mapped to tables in the 
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relational database 1 08, thereby providing an object-ori- 
ented (00) representation of the relational database 
108. This provides a certain degree of independence 
from the schema of the relational database 108. 

TeraMiner™ 

[0030] The Object Manager 208 also provides a mod- 
ule 214 that accesses an application programming in- 
terface (API) to the TeraMiner™ product provided by 
NCR Corporation, the assignee of the present invention. 
The TeraMiner™ product is further described in the co- 
pending and commonly-assigned Application Serial No. 
09/410,530, filed on October 1 , 1999, by Todd M. Brye, 
entitled "SQL-Based Analytic Algorithm for Rule Induc- 
tion," attorney's docket number 8221 , which application 
is incorporated by reference herein. 
[0031] TeraMiner™ provides functionality that allows 
the RDBMS 106 to support data mining operations 
against the relational database 1 08. In TeraMiner™, ad- 
vanced analytic processing capabilities for data mining 
applications are placed where they belong, i.e. , close to 
the data. Moreover, the results of these analytic 
processing capabilities can be made to persist within the 
database 108 or can be exported from the database 
1 08. These analytic processing capabilities and their re- 
sults are exposed externally to the RDBMS 106 by the 
application programmable interface (API). 

CRM Client 

[0032] FIG. 3 illustrates a graphical user interface 
(GUI) 300 of the CRM Client 102 according to the pre- 
ferred embodiment of the present invention. The CRM 
Client 102 is a web browser application that provides a 
"workbench" that uses a visual programming metaphor 
to assist the user in creating, modifying, deleting, and 
executing Application Templates 206. 
[0033] The GUI of the CRM Client 1 02 is divided into 
two panes 302 and 304, with a menu bar 306 and tool 
bar 308 above the panes 302 and 304. 
[0034] With regard to the menu bar 306, from left to 
right, the menus comprise "File," "View," and "Help" 
menus. These menus and their associated functions are 
similar to those found in any application. 
[0035] With regard to the tool bar 308, from left to 
right, the icons represent the functions: "save current 
template," "segment," "filter," "measure," "conditional," 
"de-duplicating," "merge," "random N," "random %," 
"split," "top N," "top %," and "link." These functions are 
described in more detail below: 

• The "save current template" function saves the Ap- 
plication Template 206 displayed in pane 304 to a 
specified storage location, either on the CRM Client 
102 or CRM Server 104. 

• The "segment" function allows the user to perform 
a number of different tasks related to segments, in- 



cluding: Get a Segment, Persist a Segment, etc. 

• The "filter" function allows the user to perform a 
number of different tasks related to Filters, includ- 
ing: Constrain a Segment (reduce its size based on 
Attribute selection criteria, e.g., "Disposable In- 
come >$40000"), Profile a Segment (retrieve a pre- 
defined set of its Attributes), etc. 

• The "measure" function allows the user to perform 
a number of different tasks related to Measures, in- 
cluding: Apply Measures to Segment (e.g., calcu- 
late "Profitability" for "California Customers"), De- 
fine a Measure (which includes a "Define a Derived 
Measure"), etc. 

• The "conditional" function causes a conditional 
branch on a stream. Conditional branching is based 
on values of a Measure (e.g., If Value = Top 20% 
Do A, Else Do B). Branching can be a binary deci- 
sion or multiple choice decision, similar to a case 
statement. Normally, conditional branches will join 
back into a single flow. 

• The "de-duplicating" function refers to de-duplica- 
tion and removes all duplicate records (i.e., rows 
having the same primary key value) in a merged 
Segment. 

• The "merge" function merges back a previously split 
workflow into a single stream. Two types of "merge" 
are supported: (1) "classic merge" which when 
merging a Segment X with attributes A and B with 
Segment Y with attributes B and C, will return a Seg- 
ment Z with attribute B, and (2) "merge with de- 
dupe" which when merging a Segment X with at- 
tributes A and B with Segment Y with attributes B 
and C, will return Segment Z with attributes A, B, 
and C. All duplicates of attribute B will be removed. 
Also, the merge function requires that the two Seg- 
ments being merged come from the same top level 
Segment. 

• The "random N" function returns a random sample 
of size N of the Segment. 

• The "random %" function returns a random sample 
of size N percentile of the Segment. 

• The "split" function causes the workflow to branch 
out into two or more separate, concurrent flows 
based on the value of a Measure or Attribute. 

• The "top N" function returns the top N number from 
the Segment. 

• The 'lop %" function returns the top N percentile 
from the Segment. 

• The "link" function links two steps in the workflow 
together. 

[0036] In the left hand pane 302, a tree display shows 
a hierarchy comprised of a Workbench level having a 
subordinate Analysis Manager level. The Analysis Man- 
ager includes subordinate levels comprised of Tem- 
plates, Segments, Filters, Measures, and Functions. 
Each of these subordinate levels includes zero or more 
copies of the specified components that have been cre- 
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ated and are available for reuse. New copies of the 
specified components can be created using a "New" 
function from the "File" menu. 
[0037] The icons from the left hand pane 302 or the 
tool bar 308 may be "dragged and dropped" onto the 
right hand pane 304 and then linked together to create 
the sequence of steps that comprise the workflow of an 
Application Template 206. The sequence of the icons 
and the direction of the connecting arrows determine the 
sequence of execution and data flow. 
[0038] Users can choose from a set of pre-defined Ap- 
plication Templates 206 from the tree display of the left- 
hand pane 302 or create ad hoc Application Templates 
206, as desired. In either case, the user can create and 
manipulate the Application Template 206. Further, users 
can add, modify and delete the steps (icons) within the 
Application Template 206. 

Incorporating Predictive Models 

[0039] In the preferred embodiment, the CRM Client 
102 can invoke an analytic algorithm for rule induction 
via the module 214 of the Object Manager 208 to per- 
form the "Define a Derived Measure" task of the "meas- 
ure" function, thereby creating a predictive model, in- 
stead of the user providing the definition of the Measure 
(in the form of a formula). The module 214 accesses or 
invokes the application programming interface (API) to 
theTeraMiner™ product to invoke the analytic algorithm 
for rule induction . This analytic algorithm for rule induc- 
tion is further described in the co-pending and common- 
ly-assigned Application Serial No. 09/410,530, filed on 
October 1 , 1999, by Todd M. Brye, entitled "SQL-Based 
Analytic Algorithm for Rule Induction," attorney's docket 
number 8221, which application is incorporated by ref- 
erence herein. 

[0040] The analytic algorithm for rule induction pro- 
vided by the TeraMiner™ product builds a predictive 
model based on the Segment and related Attributes pre- 
sented to it, and returns the model as the new Measure. 
The Measure (i.e., the model) can then be used against 
other Segments with matching Attribute definitions (us- 
ing the "Apply Measures to Segment" task of the "seg- 
ment' function), for test purposes and later for deploy- 
ment purposes. 

[0041] The use of the predictive model is illustrated in 
the workflow of FIG. 4, described further below, which 
represents an Application Template 206 that might be 
used in a customer retention campaign (i.e., using a pre- 
dictive model that will identify those customers that have 
a high likelihood for attrition). This workflow is a simpli- 
fied subset of a deployment phase of a standard data 
mining process. 

[0042] In order to produce the best results from the 
workflow illustrated below, it would be necessary for a 
skilled user to have previously built a predictive model 
for the class of problem at hand (in this example, Attrition 
Propensity scoring), and to have recorded the key 



knowledge gained from this exercise in the form of such 
a reusable Application Template 206, such that less 
skilled users can benefit from this knowledge. 
[0043] The Application Template 206 records such 
s knowledge as: 

• what type of customers (or ex -customers) to use in 
building the model, 

• which Attributes and Measures are potential predic- 
10 tors of the target behavior (including any derived 

variables, recreated in the form of Derived Meas- 
ures). 

[0044] This knowledge is used as the starting point for 
*s the user, who adapts the Application Template 206 to 
the particular context of the business problem at hand. 
In addition to changing initial default values (e.g., the 
model accuracy requirement), this might include select- 
ing a customer segment as the input set, selecting a set 
20 of input variables (Attributes and Measures), and apply- 
ing the derived Measures. 

Example Application Template 

25 [0045] The following describes an example Applica- 
tion Template 206. 

[0046] Assume that a cellular phone company has 
been losing customers in California, due to increased 
competition in the state. The user develops a retention 
30 campaign to target current customers with a special of- 
fer aimed at retaining them by offering them an attractive 
1-year early renewal package. She selects the target 
group first by narrowing the "California Customers" Seg- 
ments down to a "Rich CA Customers" Sub-Segment, 
35 i.e., those who own 2 or more vehicles and who have a 
disposable income above $40,000. The "Rich CA Cus- 
tomers" Sub-Segment is further constrained by select- 
ing only the most profitable 50% of these customers, 
then ranking these customers on a compound calcula- 
te tion of profitability multiplied by likelihood of attrition, and 
selecting only the top 100,000 of these customers (i.e., 
those customers with high profitability and high likeli- 
hood of attrition). In creating the retention campaign, the 
user invokes a previously-built Attrition Propensity mod- 
45 el that predicts which customers are most likely to ter- 
minate their service with the company. This is illustrated 
in the following 7-step workflow. 
[0047] Referring to FIG. 4, an example Application 
Template 206 is shown, wherein the Application Tem- 
so plate 206 includes a workflow comprising a sequence 
of steps using icons. The sequence and data moves 
along the flow of the arrows linking the various icons. 
The logic of the Application Template 206 is described 
below in conjunction with FIG. 5: 

55 

Icon 400 represents a "Get a Segment" task for a 
Segment referred to as "Rich CA Customers," Seg- 
ment (Block 500). The "Rich CA Customers" Seg- 
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ment is comprised of 1 ,500,000 customers as indi- 
cated by the label. 

Icon 402 represents a "Constrain a Segment" task 
performed against the resulting Segment to create 
a Sub-Segment "2+ Vehicles" (Block 502). Before 5 
running the task, the Application Template 206 
prompts the user to optionally subset the list of At- 
tributes to be returned in the Sub-Segment (the in- 
itial default is "all," but this may be modified by the 
user). In this step, a Sub-Segment is selected of 
those "Rich CA Customers" who own 2 or more ve- 
hicles ("Vehicle Ownership" Attribute > 1 ") and have 
more than $40,000 in disposable income ("Dispos- 
able Income" Attribute > 40,000"), thereby yielding 
a Sub-Segment of 600,000 customers as indicated 
by the label. 

Icon 404 represents an "Apply Measures to Seg- 
ment" task performed against the Sub-Segment 
(Block 504). The Application Template 206 exe- 
cutes a "Profit" Measure against the Sub-Segment, 
which calculates the profitability of the customers 
(e.g., revenues minus expenses). 
Icon 406 represents an "Apply Measures to Seg- 
ment" task performed against the Sub-Segment 
(Block 506). The Application Template 206 exe- 
cutes a 'Top n%" Measure against the Sub-Seg- 
ment, which orders the customers by profitability 
and selects only the top 50% of the resulting or- 
dered customers, thereby yielding a Sub-Segment 
of 300,000 customers as indicated by the label. 
Icon 408 represents an "Apply Measures to Seg- 
ment" task being performed against the Sub-Seg- 
ment (Block408). The Application Template 206 ex- 
ecutes a rule induction tree against the Sub-Seg- 
ment to compute the Measure "Attrition Propensity" 
for each record therein ("Attrition Propensity" is the 
name given to output of the rule induction tree). In 
the preferred embodiment, the Measure "Attrition 
Propensity" is the predicted attrition propensity 
score, normalized to a range of 0 to 100. Although 
the Measure "Attrition Propensity" could be a pre- 
calculated score, it is calculated dynamically in the 
preferred embodiment by executing an Attrition 
Propensity predictive model against the Sub-Seg- 
ment. A key aspect of the present invention is this 
dynamic application of a predictive model to 
achieve such effects as dynamic scoring. 

[0048] In the preferred embodiment, the Measure "At- 
trition Propensity" is built as described in the co-pending 
and commonly-assigned Application Serial No. 
09/™,---, entitled "BUILDING PREDICTIVE MODELS 
WITHIN INTERACTIVE BUSINESS ANALYSIS PROC- 
ESSES," filed on same date herewith, by Ken O'Flaher- 
ty, attorney's docket number 8983. Specifically, the Ap- 
plication Template 206 builds the rule induction tree (al- 
so known as a decision tree model) by invoking a model- 
building algorithm (e.g., the analytic algorithm for rule 



induction) via the module 214 of the Object Manager 
208, wherein the module 214 accesses or invokes the 
application programming interface (API) to the TeraMin- 
er™ product and the analytic algorithm for rule induction 
as described in the co-pending and commonly-assigned 
Application Serial No. 09/410,530, filed on October 1, 
1999, by Todd M. Brye, entitled "SQL-Based Analytic 
Algorithm for Rule Induction," attorney's docket number 
8221, which application is incorporated by reference 
herein. 

[0049] The analytic algorithm for rule induction in the 
TeraMiner™ product uses the RDBMS 106 to retrieve 
counts and order data within the database 1 08, and then 
extracts the information to determine the rules or splits 
in the rule induction tree. The rule induction tree com- 
prises a predictive model (a formula in SQL) that pre- 
dicts the likelihood for attrition. One advantage to the 
rule induction tree is that it is invoked automatically and 
transparently by the user as part of the workflow in the 
Application Template 206 using familiar business terms. 
[0050] When the Measure "Attrition Propensity" is ex- 
ecuted, such that it is applied against the current Sub- 
Segment in the workflow, it dynamically estimates the 
propensity for attrition of each customer in the Sub-Seg- 
ment, using the previously-built predictive model (e.g., 
the rule induction model), thereby generating a propen- 
sity score for each customer. Thus, the attrition propen- 
sity estimate is fully up-to-date for each customer, be- 
cause it uses the current values for each of the input 
variables required by the model. In data mining termi- 
nology, this is an example of "model deployment," i.e., 
deploying or executing a previously-built predictive 
model. 

[0051] Icon 410 represents an "Apply Measures to 
Segment" task performed against the Sub-Segment 
(Block 510). The Application Template 206 executes a 
Top n" Measure against the Sub-Segment, which or- 
ders the customers by profitability and selects only the 
top 1 00,000 of the resulting ordered customers, thereby 
yielding a Sub-Segment of 100,000 customers as indi- 
cated by the label. 

[0052] At icon 412, the resulting Sub-Segment can be 
saved or persisted (Block 512). 
[0053] In one alternative embodiment, any type of 
computer could be used to implement the present in- 
vention. In addition, any database management system, 
decision support system, on-line analytic processing 
system, or other computer program that performs simi- 
lar functions could be used with the present invention. 
[0054] In summary, the present invention discloses a 
Customer Relationship Management (CRM) system 
that incorporates predictive models. The system is used 
by business users who are unfamiliar with the art of data 
mining. The predictive model, which is constructed by a 
model-building mechanism in a data mining subsystem, 
accepts the appropriate input attributes, performs cal- 
culations against a segment comprised of records, and 
generates an output attribute. 
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Claims 

1 . A method for using predictive models within a com- 
puter-implemented business analysis environment, 
comprising: 

(a) applying a derived measure against a seg- 
ment, wherein the derived measure comprises 
a predictive model that previously-built by a 
model-building mechanism in a data mining 
system; and 

(b) generating output for the segment from the 
predictive model in the form of measure values. 

2. The method of claim 1 , wherein the derived meas- 
ure is invoked within an application template, the 
application template comprises a sequence of ele- 
ments linked together in a workflow, and the ele- 
ments are selected from a group comprising a seg- 
ment, a filter, a measure and a function. 

3. The method of claim 2, wherein the application tem- 
plate is constructed in a visual programming envi- 
ronment. 

4. The method of claim 2, wherein the application tem- 
plates can be reused and/or modified by users. 

5. The method of claim 2, wherein a segment is a 
grouping of data elements from a database organ- 
ized about one or more attributes. 

6. The method of claim 2, wherein a filter defines one 
or more attribute constraints applied to a segment. 

7. The method of claim 2, wherein a profile is a labeled 
collection of attributes of a segment. 

8. The method of claim 2, wherein a measure is an 
expression applied to a segment. 

9. The method of any of claims 1 to 8, wherein the 
computer-implemented business analysis environ- 
ment includes an object model, and the segments, 
attributes, filters, and measures comprise objects. 

1 0. The method of claim 9, wherein operations upon the 
objects are translated into SQL statements that ac- 
cess corresponding tables and columns in a rela- 
tional database. 

11. The method of any of claims 1 to 10, wherein the 
predictive model comprises one or more SQL state- 
ments that access tables and columns in a relation- 
al database. 

12. The method of any of claims 1 to 10, wherein the 
predictive model comprises one or more statements 



executed by a database management system. 

1 3. The method of claim 1 2, wherein the statements ac- 
cess data stored in the database management sys- 

5 tern. 

14. The method of any of claims 1 to 13, wherein the 
model-building mechanism comprises an analytic 
algorithm for rule induction performed against data 

w stored in a database management system to create 
the predictive model. 

15. A computer-implemented system for using predic- 
tive models within a computer-implemented busi- 
es ness analysis environment, comprising: 

(a) means for applying a derived measure 
against a segment, wherein the derived meas- 
ure comprises a predictive model that previous- 

20 ly-built by a model-building mechanism in a da- 

ta mining system; and 

(b) means for generating output for the seg- 
ment from the predictive model in the form of 
measure values. 

25 

16. The system of claim 15, wherein the derived meas- 
ure is invoked within an application template, the 
application template comprises a sequence of ele- 
ments linked together in a workflow, and the ele- 

30 ments are selected from a group comprising a seg- 
ment, a filter, a measure and a function. 

17. The system of claim 16, wherein the application 
template is constructed in a visual programming en- 

35 vironment. 

18. The system of claim 16, wherein the application 
templates can be reused and/or modified by users. 

40 19. The system of claim 16, wherein a segment is a 
grouping of data elements from a database organ- 
ized about one or more attributes. 

20. The system of claim 1 6, wherein a filter defines one 
45 or more attribute constraints applied to a segment. 

21. The system of claim 16, wherein a profile is a la- 
beled collection of attributes of a segment. 

50 22. The system of claim 16, wherein a measure is an 
expression applied to a segment. 

23. The system of any of claims 15 to 22, wherein the 
computer-implemented business analysis environ- 

55 ment includes an object model, and the segments, 
attributes, filters, and measures comprise objects. 

24. The method of claim 23, wherein operations upon 
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the objects are translated into SQL statements that 
access corresponding tables and columns in a re- 
lational database. 

25. The system of any of claims 15 to 24, wherein the 5 
predictive model comprises one or more SQL state- 
ments that access tables and columns in a relation- 
al database. 

26. The system of any of claims 15 to 24, wherein the 10 
predictive model comprises one or more statements 
executed by a database management system. 

27. The system of claim 26, wherein the statements ac- 
cess data stored in the database management sys- 15 
tern. 

28. The system of any of claims 15 to 27, wherein the 
model-building mechanism comprises an analytic 
algorithm for rule induction performed against data 20 
stored in a database management system to create 

the predictive model. 

29. An article including one or more machine-readable 
storage media containing instructions for using pre- 25 
dictive models within a computer-implemented 
business analysis environment , the instructions 
when executed by a computer system causing the 
system to perform the method of any of claims 1 to 

14. 30 

30. A data structure stored in memory of a computer 
comprising instructions forf or using predictive mod- 
els within a computer-implemented business anal- 
ysis environment, which when executed cause the 35 
computer to perform the method of any of claims 1 

to 14. 

31. A computer program comprising program instruc- 
tions for using predictive models within a computer- 40 
implemented business analysis environment, the 
instructions when executed on a computer system 
causing the system to perform the method of any of 
claims 1 to 14. 
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